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tranafom.it Ion  and  the  general  least-squares  procedure  for 
^unctions  having  nonlinear  parameters  are  described  In  detail. 
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regression,  frequency  functions,  and  polynomials.  References 
arc  provided  that  describe  Che  methodology  In  greater  detail. 
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I.  INTRODUCTION 


in  the  analysis  of  experimental  data,  it  is  often 
a  .ruthematical  expression  of  the  relationship 

determination  of  an  empirical  fonm.la.  JJe  Kcal 

fit  for  the  plotted  data,  depends  upon  an  understanding  o.  the  phjsic 

significance  of  the  data  and  a  knowledge  of  the  8*"*”^ 
varietv  of  curves.  Once  the  equation  of  a  curve  has 

set  of  data,  the  constants  of  the  equation  must  ^ 

the  methods  available  for  fitting  a  curve  to  experimental  data. 


The  ob*«ctive  of  this  paper  is  to  present  in  as  concise  a  fora  as  pos¬ 
sible  methods  for  handling  a  wide  variety  of  functions, 

nvore  co4le*  I"  int.re.t  of  ^..plng  the 

to  follw,  proofs  and  numerical  examples  of  the  ‘h* 

K^r^^ied  It  is  recousMAded  thet  the  reader  refer  directly  to  the 
U^r'^ri  1;r  further  detail,  on  the  method  he  flnelly  decide,  to  employ. 
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II.  STRAIGHT-LINE  TRANSFORMmONS 


A.  INTRODUCTION 

If  the  experimental  data,  when  plotted  on  arithmetic  coordinate  paper, 
do  not  approximate  a  straight  line  y  «  a  -h  bx  but  do  seem  to  approximate  a 
sniooth  curve,  the  shape  of  the  curve  and  an  understanding  of  the  nature  of 
the  experiment  may  suggest  the  equation  of  the  curve  that  will  fit  the 
points  most  closely.  A  very  helpful  way  of  verifying  the  appropriateness 
of  a  particular  equation  for  a  set  of  data  is  to  transform  the  data  in  such 
a  way  that  a  straight-line  plot  y'  <■  a*  -f  bx'  is  obtained.  If  a  straight 
line  is  achieved  through  some  transformation,  the  equation  corresponding  to 
the  transformation  will  be  known  to  be  satisfactory  and,  furthermore,  the 
parasttters  of  the  equation  can  be  evaluated  directly  from  the  linear  form 
of  the  equation. 

Table  I  is  furnished  as  a  quick  reference  to  the  methods  of  straight- 
line  transformation  available  for  fitting  curves  to  experimental  data.  A 
collection  of  graphs  (Figure  1)  is  included  to  illustrate  the  effect  of 
variation  of  the  parameters  (constants)  in  the  equations  presented  in  the 
table.  The  general  techniques  of  linear  transformation  displayed  in  the 
table  are  outlined  in  Section  II.  For  numerical  examples  and  more  detailed 
explanations  of  the  methods  of  transformation,  the  reader  is  directed  Co 
the  iiteraCure  Cited. Joh.troi.^  treats  Che  widest  variety  of  general 
types  of  equations,  and  his  t>ook  would  be  recommended  as  a  first  reference. 
The  other  books  treat  much  of  the  same  material  as  Johnson  does,  but  DeviS* 
and  Running^  present,  in  addition,  a  few  of  the  more  specialised  procedures 
ctiat  are  given  in  Table  X. 


I.  CFNEIAL  FROCeOUICS  FOR  (.XMCAR  TRAKSFOCMATIOM 

1.  Reciprocals  or  logerithme  [Equations  (1),  (2),  .}),  (4),  end  (S)  of 
Table  ll 

Logarithms  or  reciprocals  of  one  or  both  of  Che  variables  may  be 
taken  before  the  values  are  plotted  on  arithmetic  coordinate  paper.  Alter¬ 
natively,  logarithmic  or  reciprocal  paper  nay  be  used  to  eliminate  the  need 
for  making  these  additional  calculations. 

2.  Selection  of  a  Point  on  the  Experimental  Curve  [Equations  (Ja),  (6), 
and  (7)  cf  Table  l} 

A  particular  point  (xj,  y^)  on  the  experimental  curve  may  be  used 
to  reduce  the  original  equation  to  a  linear  form  in  two  paramaters.  Occa¬ 
sionally,  more  than  one  point  ia  used  in  this  way. 

).  Combination  of  Methods  (1)  and  (2)  [Equations  (4a),  (4b),  (Sa),  (Sb), 
and  (4a)  of  Table  l] 


Combination  of  Two  Simple  Curves 


(a)  Sum  of  two  simple  equations,  each  of  v/hich  influences  a  parti- 
iMilar  portion  of  the  curve  (Methoi  '4  of  Table  1). 

When  a  portion  of  the  data  appears  C<\  be  linear  on  soiae  type  of 
coordinate  paper,  wnereas  the  remainder  shows  a  distinct  curvature,  the 
linear  portion  may  be  fitted  bv  the  appropriate  formula,  and  •'hen  the 
deviations  of  the  rest  of  the  data  from  this  straight  line  may  be  fitted  by 
a  second  function.  The  final  equation  will  be  the  sum  of  the  two  functions, 
each  function  being  negligible  for  the  portion  of  the  curve  where  the  other 
function  has  the  siost  Influence. 

(b)  Sun  of  two  simple  equations,  bo'h  of  which  exert  Influence  over 
the  entire  range  of  the  curve  [Equations  (10),  (11),  and  (12)  of  Table  l]. 

Although  no  portion  of  the  plotted  cur^'e  is  linear  on  any  of  the 
types  of  coordinate  paper,  sosm  curvea  may  be  represented  by  a  sum  of 
simple  equations.  In  thla  case,  ratloa  of  the  successive  values  of  y  ars 
plotted  on  arithmetic  coordinate  paper  to  obtain  a  'straight  line  th't  will 
give  the  valuee  of  two  o*  the  paremeters.  The  remaining  two  parameters 
can  then  be  evaluated  from  the  original  equation. 

(c)  Two  separata  equationt,  each  valid  for  a  restrlctad  range  of 
'he  variables. 

When  It  Is  Impossible  to  repreaent  e  s<tt  of  data  by  an  aquation 
Involving  few  conatanit,  it  la  somatlmas  advtaabla  to  fit  difftrent 
equations  to  distinct  portions  of  the  curve.* 


C.  OETtnaKATION  or  THE  COMSTAIftS 

0r.ee  (he  equation  H«e  been  selected,  there  err  eeverel  ways  In  which 
the  parameters  of  the  curve  may  be  epproalmated,  nependlng  on  the  degree 
of  accuracy  required,  reference  will  be  made  later  in  the  paper  to  the 
principle  of  sMximum  likelihood  and  the  method  of  mosmnts,  which  are  useful 
In  the  estimation  ot  peraiMtert  for  certain  types  of  functions.  The  three 
most  common  methods  of  peramate;  estimation,  given  In  the  order  of  the 
accuracy  afforded,  are  the  method  of  least  squares,  the  method  of  averages, 
and  the  aathod  of  selected  points,  rince  these  three  methods  ere  fully 
described  In  etandard  texibnoks,  theta  le  no  need  to  glee  e  detelled  ax- 
plenatlon  of  them  here.  The  amthod  of  least  squares  gives  the  best  fit 
end  Is  generelly  relied  upon  for  accurate  asttmatee  of  the  parameters. 
Nodem-day  computing  fscllltlea  relieve  the  problems  once  Impoeed  by  the 
Isbocioue  computations  •-equlred  In  the  least •squares  proceas.  A  modifies* 
ti.  of  the  method  of  averages  has  been  reported  by  Mir  end  Shrlvestava*  . 
Their  method  of  group  averages  can  be  employed  to  give  greater  precision 
than  would  be  furnished  by  the  method  of  sveregea  with  Ipes  trlthmatlc 
labor  than  would  be  required  by  the  method  of  least  squares. 


TABLE  I.  STRAIGHT -LIKE  TRANSFORMATIONS 
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(k).  a  |Miti 
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y  " 


A. 


B. 


C. 


D. 


_ X 

a  +  bx  +  ex'! 

^  ”  .2  -  .  lx  +  .05x'^ 

y  “  .3  -  .2x  +  .1x5 

y  “  . 2  +  .lx  *■  .05x^ 

X _ 

y  "  .3  +  .2x  +  .Ix^ 


y 


1. 


D. 

E. 


F. 


10‘ 


a  +  bx  +  ex* 


y  .  10-75  -  -SOx-e  .03x2 


y  ■ 
y  “ 
y  ■ 
y  ■ 
y  * 


J0.375  -  .15x  +  .015x2 
JQ.IS  -  .lOx  .01x2 
JO-  .25  .lOx  -  .01x2 
JO-  .375  +  .15x  -  .015x2 
jg.  .75  -»■  .30x  -  .03x2 


y  ■  a  b  loB  X  ♦  c  10*2  n 

A.  y  •  1.2  lot  X  1.2  lot*  x 

B.  y  -  1.2  ■»  lot  X  .12  lot*  » 

C.  y  •  1.2  -  lot  X  ♦  1.2  la*2  « 

D.  y  -  1.2  -  lot  X  ■»  .12  lot*  X 


Equation  (6d) 


Equation  (b{) 


Fltur*  1.  Conctoutd. 


tf  ^  ^ 

O  W  > 
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The  method  of  selected  points,  though  least  accurate,  is  very  simple 
and  can  be  used  to  get  an  immediate  rough  idea  of  the  nature  of  the  para¬ 
meters.  In  employing  this  method,  the  straight  line  is  usually  fitted 
manually  to  the  plotted  points.  The  arbitrary  eye  fit  is  the  main  factor 
contributing  to  the  inaccuracy  of  thit  .method  of  estimating  the  parameters. 
Askovitz^  has  described  a  very  interesting  method  of  fitting  a  least- 
squares  straight  line  to  a  series  of  points  by  a  method  that  is  entirely 
graphical.  Since  this  method  is  not  '.^ell  known  and  can  considerably  improve 
the  accuracy  of  the  method  of  selected  points  (provided  only  that  the 
X  values  of  the  points  are  equally  spaced),  it  will  be  described  briefly 
in  the  next  section.  For  proof  of  the  method,  the  reader  should  refer  to 
the  original  work. 


D.  SHORT-CUT  GRAPHIC  METHOD  FOR  FITTING  THE  BEST  STRAIGHT  LINE  TO  A  SERIES 

OF  POINTS  ACCORDING  TO  THE  CRITERION  OF  LEAST  SQUARES* 

Given;  Five  points —A,  B,  C,  D,  E,  (or  any  number  of  points),  with 
values  of  x  at  equal  intervals  A  x.  Consider  the  value  of  A  x  to  be  a 
spacing  unit. 

Method: 

(a)  Place  straightedge  so  that  it  joins  A  and  B  (Figure  2).  Draw 
a  straight  line  from  A  to  a  point  2/3  of  a  spacing  unit  farther.  The  point 
found  is  B' .  Note  that  B'  lies  on  a  vertical  line  2/3  A  x  to  the  right 
of  A. 


(b)  Place  straightedge  so  that  It  joins  B'  to  C.  Starting  at  B', 
draw  a  straight  line  2/3  of  a  spacing  unit  farther.  The  point  found  is 
C',  and  lies  on  a  vertical  line  2/3  A  x  to  the  right  of  B* . 

(e)  Continue  In  this  manner.  Call  the  la*'t  point  found  T. 

(d)  Now  start  at  E  and  go  backwards  in  the  same  manner.  Call  the 
last  point  found  U. 

(e)  tft  is  the  line  of  best  fit  by  least  squares  for  the  points 

A  to  E. 

(f)  As  a  check  on  the  procedure,  x  values  of  A,  U,  T,  and  E  should 
be  found  to  be  equally  spaced. 


*  "Advancing  Centroids  Technique"  (condensed  from  a  paper  by  Askovitt''). 
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Figure  2.  Graphic  ifechoe  of  Fitting  Straight  ^Inc. 


E.  EXPUMATIW  OF  TABU  1 

For  each  of  the  a^uaelooa  atatad  la  tha  flrat  caluan  of  Tobla  X,  tha 
second  coluan  gives  tha  coordinataa  that  will  yield  a  straight  line  on 
arithaatlc  coordinate  paper  whan  the  aquation  la  appropriate  for  tha  given 
data.  ColusMi  3  gives  the  corraspondlng  linear  aquation  froa  which  tha 
paraMtara  of  the  function  my  ha  dataralnad.  Ooa  rafer*r.ca  for  each 
■athod  Is  cited  In  tha  laat  coluan. 

Tha  aost  cBion  aathods  of  linear  transforvatlon  ar>  described  In  Itaaa 
(1)  through  (3)  of  Table  1.  FaalUarlty  with  thaae  tec:>ntquas  will  assist 
tha  reader  in  understanding  tha  later  entries  in  the  ta  de.  Itaa  (6) 
describes  a  aathod  for  transfoialng  a  parabola  to  linear  fora.  A  nuabar  of 
variations  of  the  basic  equation  for  a  parabola  are  glvi^n  In  Itaas  <ba) 
through  (6g).  Once  these  variations  have  been  reduced  '.o  tha  basic  parabola 
equation  by  one  of  tha  transfocaatloos  oucllnad  In  itaaa  (1)  throu^  (S), 
they  any  be  handled  by  tha  general  aathod  for  a  parabola  as  eaplalnad  In 
Itea  (6). 

Deteralnation  of  the  paraastars  in  Equations  (l>  through  (13)  requires 
tha  use  of  two  linear  equations.  Tha  flrat  linear  aquation  yields  astlaatas 
of  two  of  the  paraaetars,  which  are  used  subsequently  la  tha  second  equation 
to  find  the  raaslnder  of  the  paraastars.  Tha  aathod  presented  In  ttea  (U) 
of  tha  table  Is  appropriate  'vily  when  a  portion  of  the  plotted  data  la 
linear  either  lansdlately  or  after  soat  t tens foraat ion  has  been  parforaed. 

It  should  be  noted  that  Equations  (i),  (10),  and  (12)  any  be  handled  by 
aore  general  aethods  lAien  the  special  condition  of  partial  linearity 
necessary  for  Method  (lA)  la  not  present. 
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Wlu  1  the  symbols  A  x  and  A  y  appear  iu  items  (6)  through  (11),  they 
signify  the  differences  between  successive  values  of  the  data,  so  that 
X  =  1  ■  ’‘k  y  ~  yk  +  1  ■  ''k-  synnbol  A  y  in 

Method  (14)  is  explained  in  the  table. 

A  comment  is  needed  concerning  Equation  (I4i)  in  the  table, 

u 


V  =  (a  +  bx)  + 
The  final  term,  6 


2  cosh  C  n(X  -  x)] 

which  can  also  be  written  in  the  form 


2  cosh  [n(X  -  x)] 


-sr-  ,  Introduces  a  bump  into  the  curve.  The  signifi- 


e  U, 

6  »  .  ■■  .-I-. 

en(X  -  x)  4-  en(x  -  ^ 

cance  of  the  parasteters  X  and  a  is  explained  in  the  table.  The  effect  that 
this  term  has  when  added  to  equations  for  a  straight  line  and  for  a  parabola 
is  Illustrated  in  Figures  3  and  4. 


y  •  .5x^  -  5x  ♦  13.5  ♦  6 

•  »  _ 

I-  » 


a  •  ♦ 
n  •  4 


1  cosh  [ n  (5  -  a) j 
^  *  2  cosh  [4  (8  -  x)!l 


Figure  3.  Graph  of  Variation  of  Equation  (Ui). 
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111.  METHOD  OF  DIFFERENTUL  CORRECTION 


A .  INTRODUCTION 

Equations  chat  contain  nonlinear  parameters,  and  for  which  no  trans¬ 
formation  will  afford  a  linear  relation  among^  the  parameters,  canr.ot  be 
handled  by  the  usual  methods  described  in  Section  II.  A  general  method  that 
will  handle  such  formulas  is  knovm  as  the  method  of  differential  correction. 
This  method  is  an  iterative  process  for  which  initial  estimates  of  tit:  para¬ 
meters  are  required.  By  considering  the  approximate  values  of  the  constants 
and  the  corrections  to  each  of  these  values,  the  original  function  can  be 
expanded  in  a  Taylor’s  series,  which  will  be  linear  in  the  corrections  when 
terms  of  higher  order  than  the  first  are  ne  lected.  Either  the  method  of 
averages  or  the  method  of  least  squares  will  then  yield  the  values  of  the 
corrections.  Addition  of  the  corrections  to  the  starting  values  will  fur¬ 
nish  new  estimates  and  the  process  nay  be  repeated  fer  improved  accuracy. 

Modern  high-speed  computers  overcome  the  difficulties  once  pcsed  by  the 
rather  lengthy  calculations  necessitated  by  this  awthod.  Furthcmi  .-c,  the 
method  of  least  squares  is  usually  selected  for  machine  solution  of  the 
corresponding  regression  equation  because  accuracy,  ret’aer  than  ease  of  com¬ 
putation,  is  of  primary  concern.  The  Mthod  of  differential  correction  is, 
as  a  result,  often  referred  to  as  the  "general^ leest-«quercs  method  for  non- 
llneai  parameters." 

Tils  iterative  procedure  has  the  limitation  that  the  original  eetimates 
of  the  parameters  mutt  be  close  enough  to  the  true  values  that  higher  order 
terms  of  the  Taylor's  aeries  may  be  omitted  wlthou'  affecting  the  convergence 
of  the  procedure.  Starting  values  may  otten  be  obtained  from  a  knowledge 
of  the  physical  charecteristlce  of  the  problem  whose  data  ere  being  enelyted. 
Certain  graphical  techniques  analogous  to  those  discrlbed  in  Section  II 
will  also  be  helpful  for  certain  types  of  date,  if  an  analog  computer  is 
available,  It  can  furnish  starting  values  for  a  c*de  variety  of  functions, 
by  mlnlmiitng  the  area  between  a  series  of  line  segments  drawn  through  the 
dais  points,  end  the  curves  obtained  by  varying  r>>e  pecemetera  in  the 
equation  selected  for  representation  of  the  date. 

Although  the  method  of  differential  correction  la  often  reported  in  the 
literature,  U  is  not  slwayt  described  In  a  detailed  end  notetionelly  simple 
-ranner.  For  the  beiMfl.  .1  the  reader  not  (emillar  with  this  method,  the 
auiuo-  r.4s  in<.luded  a  susasary  of  the  Mterlal  presented  In  the  referencea 
ior  straightforward  descriptions  of  the  method,  the  reader  may  refer  to 
ginning^,  Scerborouglf  ,  and  Nielson*.  Hpre  theoretical  diecuasione  ere 
given  by  Cealng*  and  Cucst*^ .  Williams*'  und  Turner,  Monroe,  end  Lucet  ' 
ssplov  a  eM-.dificet ion  of  the  piucedure.  Speciflt-  applications  of  the 
p-c  ess  ere  repr-ted  by  HowelT*  and  Berkton  end  Cage**. 
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B.  EXPLANATION  OF  THE  METHOD 
Consider  a  function 

y  »  f  (x;  a,  b,  c),  (1) 

where  x  and  y  are  the  variables,  and  a,  b,  and  c  are  the  pataaelers.  Let 
ao,  bo,  Co  be  approxltaate  values  of  a,  b,  c,  and  let  A  a,  A  b,  Ac 
represent  corrections  to  these  Initial  values  such  that 

a  -  ao  +  A  a 

b  -  bo  +  A  b  (2) 
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whe  re 


R  =»  y  -  yo 


Ao  =  A  a 

Zo 

df 

3ao 

Ai  =*  A  b 

Zl 

- 

"  Sbo 

A2  “  A  c 
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af 

5co 

'.nls  equation  may  be  solved  by  the  usual  nethod  of  least  squares  to  obtain 
the  values  for  A  a,  A  b,  and  A  c. 


C  HOOIFICATION  OF  THE  METHOD 

Many  functions  contain  both  linaar  and  nonlinear  paraaeters.  Turner^* 
has  reported  a  nodification  of  the  general  least>equares  aethod  that  eliai- 
nates  the  necessity  of  using  any  trial  values  for  the  linear  paraaeters, 
thus  reducing  the  aaount  of  work  involved  in  the  solotivMt.  In  Turner's 
TMthod,  the  regression  equation  is  taken  froa  Equation  (3)  without  trans* 
posing  yj.  He  has  found  that  the  linear  paraeaters  «ay  then  be  coaputed 
directly,  thus  eliainating  the  teras  containing  partial  derivatives  with 
respect  to  the  linear  paraaeters.  For  exaaple,  if  f  (x;  a,  b,  c)  is  a 
(unction  in  which  a  and  b  are  nonlinear  paraaeters  and  c  is  a  linear  para* 
Tetcr,  the  regression  equation  will  be 


it 

where  and  are  evaluated  at  Sq  and  bo,  but  c  reaatns  v«*iable.  This 

vodiflcatlnn  considerably  slapliiles  the  general  i.athod  of  least  squares, 
by  requiring  initial  estlsutes  of  only  the  nonlinear  paraaeters,  end  by  per* 
victlng  the  direct  use  of  the  observed  y  velues,  rether  chan  requiring 
simulation  of  the  residuals  X  •>  y  -  yo. 


D.  UAHfUR 

Consider  the  problea  of  fitting  the  function 
{  (s:  a.  b)«ys  ab* 

to  a  set  c(  osperiaental  data.  Although  thla  function  containa  a  nonlinear 
paraaeter  b,  the  cquetion  doe*  happen  to  be  one  of  the  typea  for  which  a 


24 


linear  transformation  is  available.  The  regular  least -squares  procedure  for 
linear  parameters,  when  applied  to  Che  linearized  data,  will  supply  directly 
the  best  estimates  of  the  parameters,  and  hence  would  usually  be  employed 
fur  this  simple  formula.  It  is  interesting,  however,  Co  consider  the  appli¬ 
cation  of  the  method  of  differential  correction  to  the  esCimaClon  of  Che 
parameters  in  this  formula.  A  straight  line  may  be  fitted  by  eye  to  the 
linear  cran.. .ormation  of  the  data  to  obtain  starting  values  ao  and  ho.  By 
the  method  of  differential  correction,  we  wish  to  obtain  corrections  A  a,} 
and  A  bo  such  that  improved  estimates  will  be 

ai  -  ao  +  A  ao 

bl  ■  bo  +  A  bo 

If  ai  and  bi  still  do  not  taeet  the  requirements  for  accuracy,  they  may  then 
be  used  as  estimates  to  obtain  new  values  for  the  correcrlons,  and  hence 
improved  estimates 

a2  •  ai  ♦  A  ai 

02  •  bi  ♦  A  bl. 

The  process  may  be  repeated  as  many  times  as  necaasary  to  obtain  tha  desired 
precision.  The  two  methods  for  obtaining  the  values  of  the  corrections  are 
illustrated  below. 


Emample  1:  rittlng  y  ■  eb*  by  the  Ceneral  Least -Squares  Method  for 
Konllnear  parameters 


Given  the  function 

f  <*;  e,  b)  w  y  •  eb*, 

its  pertlal  dertvetivee  with  reepeet  to  the  peramtters  ere 
^  t  .  K  ftf  .  S-1 

n;  •  ^  •  js;  •  •• « • 

Then  the  llneerlsed  equation  Clquetion  (6)]  is 
y  -  yo  •  bo*  A  e  ♦  ep  K  bu“**  A  b, 


(») 


(10) 


where 


yp  •  f  (a;  eo,  b®)  •  e®  b©*. 
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if  ,/je  set 


R  =  y  -  Vo 

Aj  *  ^ 

■  A  b  Zj  »  Bq  X 

thr.  regression  equation  becomes 
R  *  Aq  Zq  +  Al  Zl 

and  may  be  solved  for  Aq  and  Ai  (l.e.,  the  values  of  the  corrections)  by 
the  usual  least-squares  procedure. 

Example  2:  Fitting  y  ■  ab^  by  the  Simplified  Method  for  Equations 
Containing  both  Linear  and  Nonlinear  Parameters 

Again  consider  Equation  (9).  Using  Turner's  method,  only  the  nca- 
llnear  parameter  b  need  be  assigned  a  starting  value  bo.  The  llncarlecd 
equation  [comparable  to  Equation  (B)]  takes  the  font 

y  ■  a  bo*  +  •  X  bo*”^  A  b  (11) 

Tf  we  set 

Bo  ■  a  Xo  “  ho* 

Bl  •  a  A  b  Xl  •  xbo*'^ 
tht  regression  equation  becoMS 
y  •  Bo  Xo  +  Bl  Xl. 

Tl.ls  equation  nay  he  solved  by  the  method  of  least  squares  to  obtain  Bo 
and  Di.  The  valui  fcr  Bo  yields  the  value  of  a,  which  in  turn,  can  be 
used  with  the  value  of  Bi  to  solve  for  A  b. 


E.  SPEEDING  THE  COHVERCSNCE 

In  nary  cases,  convergence  Is  slow  due  to  the  oscillations  of  the  para¬ 
meter  estlnates  about  the  true  value.  It  haa  been  found  that  these  oscil¬ 
lations  nay  be  damped  by  adding  only  a  fractional  part  of  the  corrections 
to  ths  parameter  estimates  whan  obtaining  now  watlmates.  Then  parameter 
art Inates  to  be  used  In  the  next  cycle  of  Iteration  will  be  of  the  fora 

al  •  #0  'h  9  A  ao 

bi  •  bg  +  9  A  bo 

where  9  will  usually,  but  not  always,  be  lets  than  one. 
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Wtv  stein^®  specifically  discusses  the  use  of  a  multiplier  to  accelerate 
convergence.  Turner^®,  Howell^®,  and  Will’^®  employ  such  a  factor  in  per¬ 
forming  the  least-squares  iterative  procedure.  Box''^  describes  the 
Princeton-IBM  704  program,  which  computes  the  optimum  value  :f  0  before 
obtaining  the  parameters  of  the  function  by  nonlinear  least  squares. 


F.  rTSCUbSION 


It  nay  be  noted  that  the  method  of  d'fferentlal  correction  is  a 
generalization  of  the  least-squares  process  we  commonly  use  for  equations 
Involving  only  linear  parameters.  This  general  method,  therefore,  may  be 
used  for  any  function,  whether  it  contains  linear,  nonlinear,  or  a  combina¬ 
tion  of  the  two  types  of  parameters.  In  particular,  if  the  general  least- 
squares  procedure  is  applied  to  an  equation  having  only  linear  parameters, 
and  the  initial  estimates  ao,  bo,  ''o  e*ch  taken  to  be  zero,  then  the 
corrections  ^  a,  A  b,  Ac  arrived  at  in  the  solution  are  simply  the  values 
a,  b,  c  of  the  parameters  themselves,  to  that  the  general  method  reduces  to 
the  special  case  wa  generally  employ.* 

Often  the  mathematical  model  for  en  experiment  is  written  in  terms  of  - 
a  differential  equation.  Such  an  equation  aometlmas  canno.  be  Integrated 
explicitly,  so  that  the  partial  derivat Ives  of  the  function  cannot  be 
obtained  by  direct  differentiation.  Box^^  suggests  that  If  small  changes 
are  made  In  each  of  the  parameters  in  turn,  the  numerical  value  of  the 
derivatives  may  be  calculated  from  the  dlfferencea. 

Unfortunately,  there  ere  timet  when  even  the  use  of  the  0  wltlpller 
does  not  improve  convergence  eetisfectorlly.  Vor  additional  coneideretlon 
of  the  convergence  problem,  the  reader  may  refer  to  the  paper  by  Box^'^,  who 
has  had  considerable  experience  with  the  nonllnaer  least -equares  process. 
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IV.  LINEAR  COMBIKATIOWS  OF  EXPONENTIALS 


n 


Experimental  data  are  frequently  to  be  well  represented  by  sums 

of  exponentials  of  the  general  form 

y  •  £  At  e*'>lX 
t-1 

As  a  result,  a  number  of  methods  fur  estimation  of  the  parameters  of  such 
a  fur.ctlon  have  been  Investigated  and  reported  in  the  literature.  Some  of 
these  methods  will  be  mentioned  briefly  here.  For  the  details,  direct 
reference  should  be  made  to  the  literature. 

Peurzeig  and  Tyler^*  have  illustrated  clearly  the  graphical  "peeling 
off"  procedure  explained  in  item  (14)  of  Table  I.  They  have  extended  the 
method  to  the  case  of  an  equation  containing  snre  then  two  terms,  but  the 
parameter  estimates  obtained  are  only  approximate.  A  method  developed  by 
Prony  and  discussed  by  Whittaker  and  Robinson^* ,  Comall^° ,  and  Cor^f ^.eld 
et  aj^^,  can  sasMttimea  be  used  to  advantage  in  obtaining  initial  estimates 
for  later  refinement  by  one  of  the  iterative  procedurea.  Iterative  nmximum 
likelihood  techniques,  one  of  which  is  similar  to  the  iterative  leaet- 
squares  procedure,  are  briefly  reviewed  by  Cornell*^. 

Taylot*'*  used  difference  equations  in  his  method  of  attacking  the 
problem.  The  parameter  estimates  obtained  by  his  process,  however,  have 
not  been  found  to  be  as  consistently  accurate  as  those  calculated  by  some 
of  the  other  methods. 

Cornell**’  hee  developed  an  effective  noniterative  procedure  for  fitting 
a  fairly  general  class  of  linear  combinations  of  exponentials  to  data  taken 
at  equally  spaced  i.itervele.  Me  derives  expreesitms  that  employ  as  msny 
sums  of  Che  observations  as  there  ere  peramsters  to  be  estimeted.  Mis 
method  has  two  advantages  over  Che  Prony  method;  it  does  not  place  a  limit 
on  the  number  of  observations,  and  it  does  not  require  least'squares 
calculations. 

Hartley's  "Internal  Least  Squares"*’  is  a  fairly  well-known  amthod  Chat 
can  be  applied  to  functions  having  linear  differential  equations,  end  hence 
to  linear  combinations  of  exponentials,  hocr.use  they  ere  generated  by 
linear  differential  equations.  The  linear  difference  equation  corraspendlng 
to  the  differehtlal  equation  la  integrated  numerically.  In  Che  resulting 
regression  equation  "the  dependent  variate  y  la  related  to  its  own  repeated 
auiM  as  independent  variables."**  The  principle  of  least  squares  is  then 
applied  to  this  regression  equation. 
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V.  ASYMPTOTIC  REGRESSION 


A.  INTRODUCTION 

,„=,r  W-.2!  =ri; 

'p;«d,  ;h.’''“ir«°4*iH^«.. 


B.  EStmtlOH  or  T«  PARAWTEBS 

TH.  -th.d.  .V..UM.  *.;^“u>. 

-n:i  r.:.r  ;::iutr;r.  .r^-r  H-E^Tw 

Mt«  v«lu«i  of  a  and  0  fro«  tha  Itnaar  ra|ta»iloi>  of  oo/f  . 

•«.-  fam  V  •  A  [l-lO-«(**’^)]  of  tha  Bodiflad  anpowintlal  la  known  aa 

ao:  ;:rUa:M:r:idaI,  an-  fo«nd  -ry  .«.jcabl. 

GoMa'  and  Stavana*  tabUi  haa  boon  provldad  by  iyrd,  Jonaa,  ft  at. 
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y  ■  a  +  * 
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Figure  S.  Graph  of  Hodifled  Exponential  Curve. 
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Figure  7.  Graph  of  Leglatlc  Curvee. 
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D,  LOGISTIC  CURVE 

One  of  the  most  widely  used  asymptotic  curves  is  the  logistic,  whose 
equation  is 


y  *  l+?»^ 


Taking  reciprocals  will  transfomi  Che  logistic  to 

i.  i*i..  (.b,» 

which  it,  again,  a  modified  exponential.  Such  a  CransformeClon  la  rarely 
performed,  however,  because  Che  ...ethods  of  dealing  with  Che  logiaclc  iceelf 
are  so  numerous. 

One  of  the  simplest  and  best  known  ways  of  obtaining  parameter  estimates 
is  the  transformation  of  the  equation  into  the  form 


In  •  a  >  bx 

which  it  a  straight  line  on  seal •logarithmic  graph  paper  whan  x  is  plotted 
against  (k*y)/y.  Fearr^  and  Reed  and  Berksoir*  have  illustrated  this 
method  of  fitting,  and  have  alto  given  good  discuesions  of  the  general 
properties  of  the  logistic  function. 

Berkeen**'**  has  provided  graph  paper  scaled  in  such  a  way  that  the 
linear  relation  may  be  obtained  by  plotting  x  directly  against  y/k. 

Berkson's  "logit  paper*'  it  similar  to  an  earlier  logistic  grid  reported  by 
Wilsoh**.  Hodges’*  explains  a  "transfer  mmthod  which  can  be  used  to  improve 
the  eatlmstee  obtained  from  the  linear  plot  on  logit  paper."  Barkson**'** 
has  also  provided  cablsa  for  the  logic  and  antllogit,  to  facilitats  the 
determination  of  the  maxiauai  likelihood  estimate  of  the  logistic  function. 

Several  other  linear  trensfoimations  of  the  logistic  have  bean  developed. 
Hair**  gives  e  very  good  suasry  of  five  t tans foraat ions  chat  are  baaed  on 
the  differential  equation  of  the  logistic  or  on  some  form  of  the  corresponding 
difference  equation. 
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A  generalization  of  the  logistic  sometimes  provides  a  better  fit  for 
a  set  of  experimental  data.  Terms  may  be  added  to  the  exponential  to  alter 
the  shape  of  the  curve  apprcirlately .  The  general  logistic  is  given  in  the 
form 


^  1  ga+bxfcx^. . . 

where  the  number  of  terms  in  the  exponent  is  detctsdncd  by  the  type  of  fit 
desired.  Figure  7  illustrates  the  difference  between  a  simple  logistic 
curve  and  a  more  general  logistic,  whore  exponential  is  quadratic.  3ailey*^ 
has  published  some  tables  for  use  in  fitting  the  generalized  logistic. 

There  are  many  other  equations  that  can  be  used  to  represent  symmetrical 
growth  curves  bssldes  that  of  the  logistic  function.  Questions  often  arire 
as  to  which  equation  would  b:  most  appropriate  for  a  certain  set  of  data. 

An  interesting  article  by  Ulnsor*^  cosiperes  the  logistic  with  the  integrated 
normal  curve,  the  arc-tangent  curve,  and  the  integrated  Peareon  Type  VIT 
curve. 
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VI.  FOLYNOMULS 


The  use  of  polynomials  In  curve  fitting  .'s  extremely  connon  end  easily 
achieved.  The  method  of  least  squares  yields  parasteter  estlsiates  immediately, 
because  all  parimcters  in  the  polynomial  equation  are  linear.  When  the  x 
values  of  the  data  are  equally  spaced,  the  successive  differences  jf  the 
y  values  may  be  examined  as  an  aid  in  determining  the  appropriate  number 
of  terms  for  a  polynomial  that  will  represent  the  data  adequatcl/  .  Under 
this  condition,  if  the  n^^  differences  of  the  y'a  are  constant,  then  the 
last  term  in  the  required  polynomisl  will  be  x**.  It  is  advsntageous  to  use 
orthogonal  polynomial  equations  when  the  degree  of  the  polynomial  needed  to 
fit  the  data  satisfactorily  is  not  known.  Orthogonal  polynomials  enable 
higher  degree  terms  to  be  added  to  the  equation  without  changing  the  coeffi¬ 
cients  of  the  previously  fitted  polynomisl.  Moat  textbooks  describe  the 
technique  of  using  orthogonal  polynomials  in  curve  fitting.  Paredlne  and 
Rivet c** ,  in  particular,  give  a  good  exposition  of  the  subject. 

Because  of  the  generel  femtliarity  with  the  wwthods  of  polynomial  rcgr-s- 
Sion,  polynomials  are  often  overused  for  curve  fitting.  In  spite  of  the 
fact  that  a  polynomial  can  be  found  to  fit  any  set  of  date  if  a  sufficient 
number  of  terms  are  taken,  there  Is  often  very  little  biological  or  physical 
significance  for  the  parameters  of  the  resulting  polynomial.  Considerable 
thought  should  be  giver,  therefore,  to  the  physical  basis  of  the  problem, 
in  order  to  determine  'itMther  there  is  any  amrlt  In  using  a  polynomial  to 
represent  it.  Sometimes,  of  course,  It  Is  helpful  to  use  a  polynomial  as 
an  approximation  to  another  function  for  which  the  direct  estimation  of  the 
parameters  would  have  been  difficult  or  Impossible. 


FREQUENCY  DISTRIBUTIONS 


The  general  topic  of  frequency  distributions  is  veil  known  to  statis¬ 
ticians  and  fully  covered  in  statistical  textbooks.  It  seems  unnecessary, 
therefore,  to  include  in  this  paper  any  lengthy  discussion  of  the  subject. 

An  important  class  of  frequency  functions  arising  from  Che  solutions 
of  a  certain  first-order  differencial  equation  is  Che  Pearsonian  system  of 
curves.  A  comprehensive  treatment  of  this  family  of  curves  is  given  by 
Eldertoh*^  .  Other  good  references  on  the  subject,  in  addition  to  the 
writings  of  Karl  Pearson  himself,  are  Che  discussions  given  by  Peters  and 
Van  Voorhis**  .  Kenney  and  Keeping*^,  Carver**,  Craig**,  and  Kcndallf * . 


The  familiar  normal  curve  is  one  of  the  types  of  curves  included  in 
Che  Pearsonian  system.  For  a  discussion  of  the  use  of  "probability  paper" 
to  effect  a  linear  transformation  of  the  integrated  normal  curve,  reference 
may  be  made  to  Finney's  Probtt  Analysis*^ . 

Methods  of  parameter  estimation  commonly  used  for  frequency  distri¬ 
butions  arc  the  method  of  awments,  the  principle  of  maximum  likelihood, 
and  Che  minimum  chi-squared  process.  These  procedures  sre  presented  in 
standard  textbooks  and  in  the  references  given  for  the  Pearsonian  curves. 

It  should  be  noted  chat  the  use  of  these  methods  is  not  restricted  to  the 
fitting  of  frequency  distributions.  An  application  of  the  mathod  of 
iraxlmum  likelihood  to  curve  fitting  In  general  as  reviewed  by  Cornell**, 
and  O'Toole**  applies  the  mathod  of  moments  to  a  fairly  general  class  of 
functions. 
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VIII.  SUMMARY 


-his  paper  Is  offered  as  a  guide  for  t'.. se  persons  who  seek  to  represent 
experimental  data  by  some  appropriate  mathematical  expression.  For  straight- 
line  transformations  and  the  general  method  of  differential  correction,  It 
should  be  possible  to  work  directly  from  this  paper.  For  the  other  methods 
tr.-ated  in  less  detail^  the  discussion  should  serve  to  indicate  the  literature 
available  on  the  particular  topic  of  Interest  to  the  experimenter. 

It  is  emphasised  that  this  paper  deals  with  only  the  various  types  of 
curves  and  the  matheMtical  techniques  for  fitting  them  to  experimental 
data.  An  experimauter  needs  such  general  information  as  a  background  for 
the  mathematical  interpretation  of  his  problem.  Another  extremely  important 
consideration,  however,  is  the  biological  or  physical  significance  of  the 
data  being  analysed.  No  mathematical  expression  can  be  C'?nsidercd  adequate 
for  representing  experimental  data  unless  it  has  evolved  irom  the  physical 
basis  of  the  problem.  Such  scientific  analysis  of  the  problem  should  be 
conducted  before  the  techniques  of  curve  fitting  are  applied. 
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